An error-correcting code framework for genetic sequence analysis
نویسندگان
چکیده
A fundamental challenge for engineering communication systems is the problem of transmitting information from the source to the receiver over a noisy channel. This same problem exists in a biological system. How can information required for the proper functioning of a cell, an organism, or a species be transmitted in an error introducing environment? Source codes (compression codes) and channel codes (error-correcting codes) address this problem in engineering communication systems. The ability to extend these information theory concepts to study information transmission in biological systems can contribute to the general understanding of biological communication mechanisms and extend the field of coding theory into the biological domain. In this work, we review and compare existing coding theoretic methods for modeling genetic systems. We introduce a new errorcorrecting code framework for understanding translation initiation, at the cellular level and present research results for Escherichia coli K-12. By studying translation initiation, we hope to gain insight into potential error-correcting aspects of genomic sequences and systems. Published by Elsevier Ltd. on behalf of The Franklin Institute.
منابع مشابه
Ancient DNA sequence revealed by error-correcting codes
A previously described DNA sequence generator algorithm (DNA-SGA) using error-correcting codes has been employed as a computational tool to address the evolutionary pathway of the genetic code. The code-generated sequence alignment demonstrated that a residue mutation revealed by the code can be found in the same position in sequences of distantly related taxa. Furthermore, the code-generated s...
متن کاملIs there an error correcting code in the base sequence in DNA?
Modern methods of encoding information into digital form include error check digits that are functions of the other information digits. When digital information is transmitted, the values of the error check digits can be computed from the information digits to determine whether the information has been received accurately. These error correcting codes make it possible to detect and correct comm...
متن کاملError Correcting Sequence and Projective De Bruijn Graph
Let X be a finite set of q elements, and n, K, d be integers. A subset C ⊂ X is an (n, K, d) error-correcting code, if #(C) = K and its minimum distance is d. We define an (n, K, d) error-correcting sequence over X as a periodic sequence {ai}i=0,1,... (ai ∈ X) with period K, such that the set of all consecutive n-tuples of this sequence form an (n, K, d) error-correcting code over X. Under a mo...
متن کاملError-Correcting Codes for a Steganography Application
Steganography is the process of inserting hidden messages in an unrelated transmission. An algorithm for steganography that uses an error-correcting code to embed messages in a carrier sequence is studied. A distortion limit theorem from information-theoretic analysis is given, and optimal code parameters are discussed.
متن کاملInvestigation of coding structure in DNA - Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03). 2003 IEEE International
We have all heard the term "cracking the genomic code", but is DNA a code in the information theoretic sense? The coined term "genetic code" maps nucleotide triplets (codons) to amino acids. However, this is in computer coding sense because a codon instruction is performed to output an amino acid sequence. In this paper, we examine methods to detect redundant coding structure in DNA. First, a f...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2004